Zarrabizadeh 23 

WATERMARKING SCHEME FOR DIGITAL VIDEO 

Cross-Reference to Related Applications 

U.S. patent applications Serial No. (Zarrabizadeh 22) and Serial No. 
(Zarrabizadeh 24) were filed concurrently herewith. 

5 Technical Field 

This invention relates to the art of watermarking digital video, and more 
particularly, to including additional information within the digital video information. 

Background of the Invention 

Watermarking of video signals is, generally, the inclusion within the video itself 

10 of additional information. This can be useful to provide an embedded identification of 
the source of a video, to keep track of where and for how long a video is played, and to 
communicate information via the video to an ancillary device. Prior art techniques for 
watermarking video signals typically encoded the additional information in an analog 
format within the video itself using the luminance of the video to carry the additional 

15 information. However, the human visual system is very sensitive to the luminance signal, 
and so a person viewing a watermarked signal easily perceives distortion which is caused 
by the changes made to the video signal to convey the additional information when there 
is an attempt to increase the bit rate of the additional information beyond a certain point, 
e.g., beyond 120 bits per second. Thus, although the prior art's techniques of 

20 watermarking of video signals has had some success in certain applications, such success 
has been limited by the extremely small bit rate that is achievable without perceivable 
distortion by a person viewing the video signal carrying the additional information. 

In previously filed United States Patent Application Serial No. 10/342704, which 
is incorporated by reference as if set forth fully herein, I, along with my coinventor, 

25 recognized that the human visual system is much less sensitive to chrominance than to 
luminance. Therefore, we developed a system for digital watermarking a video signal 
that inserts the additional information of the watermarking signal on the chrominance 
component of the video signal rather than on its luminance signal. Thus, the additional 
information is "impressed" upon the chrominance component of the video signal. 

30 Advantageously, although there may be significant distortion of the chrominance 
component, especially when the additional information has higher bit rates than is 
achievable without perceivable distortion by the prior art, nevertheless such distortion 
will not be detected by the human visual system, provided it is appropriately managed. 
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Thus, the additional information can have a higher bit rate as compared with that 
achievable by the prior art, e.g., bit rates greater than 150 bits per second can be achieved. 
Further advantageously, the additional data can be recovered from the video signal even 
after the video signal watermarked with the additional data is compressed using the 
5 Motion Picture Expert Group (MPEG)-l and MPEG-2 encoding systems. 

I have recognized that the system of United States Application Serial No. 
10/342704 required multiple frames of the video to accurately transmit the information on 
the chrominance portion. This can be disadvantageous in some applications, e.g., where 
finer granularity of the watermarking is required so as to provide better response time and 

10 improved resistance to temporal tampering with the video, and under certain conditions, 
e.g., when there is a scene change, which can occur rapidly in the case of high speed 
movement such as a chase scene. 

In concurrently filed United States Application Serial No. (Case Zarrabizadeh 22) 
the watermarking of video is improved by having one or more bits of watermark data 

15 carried via an average value of the chrominance component of each of various blocks of 
the video signal, on up to a per-frame basis. More specifically, one or more bits of the 
watermark data may be effectively placed into specified bit positions of the average value 
of at least a selected portion of the chrominance component of up to each block of up to 
each frame. Note that typically, there are two chrominance portions, e.g., U and V when 

20 the video is represented in the YUV format, in the chrominance component. More 
specifically, each block of each frame of the original video signal may be modified to 
carry its own independent one or more bits of the watermark data in the average value of 
a chrominance portion that is selected to carry the watermark data for that block. 
Conceptually then, the value of the bit position of the average value of the selected 

25 chrominance portion that is to contain the watermark data for a block may be thought of 
as being replaced by the value of the bit of watermark data to be carried in that block. 

Only one of the chrominance portions carries watermark data for any particular 
block. The chrominance portion selected to carry the watermark data for any block may 
be independently selected for that block. The bit position of the average value that is 

30 replaced is one of the bits of the integer portion of the average value. 

If necessary, the values of the selected chrominance portion of individual pixels of 
a block may be adjusted in order to cause the bit of the average value of the selected 
chrominance portion of the block that is to carry the watermark data to be the same as the 
value of the watermark data bit. This may be achieved by changing the value of the 

35 selected chrominance portion of various pixels in the block such that over the entire block 
the average of the value of the chrominance portion is changed so that the value of the 



D:\PATENTS\Zarrabizadeh 23\Zarrabizadeh 23.doc 



2 



Zarrabizadeh 23 

select bit position of the average conforms to the value of the watermark data that is being 
placed in the selected bit position. 

If the value of the bit of the average value to contain the watermark data is already 
the same as the value of the watermark data bit, no change may be performed to any of 
5 the pixels of the block. However, if the value of the bit of the average value to contain 
the watermark data is the complement of the value of the watermark data bit to be carried 
by the block, at least the minimum change to the average value that will cause the bit to 
be changed to the value of the watermark data bit is performed on the average value. For 
example, if the bit of the average value to contain the watermark data is the second least 

10 significant bit of the integer portion of the average value, such a bit may always be 
changed to its complementary value by either adding or subtracting one to the average 
value. Doing so is preferable to adding two, which may also be used to always change 
the bit to its complementary value, because it introduces less change in the value of the 
average, and hence less change in the block, thereby reducing the chance of introducing a 

15 viewer-perceivable artifact. The change to the average value of the selected chrominance 
portion of a block is implemented by adding or subtracting — , which may be 
accomplished by adding negative numbers — , a value to the selected chrominance portion 
of ones of the pixels of the block until the desired change in the average thereof is 
achieved. 

20 When using block-based frequency domain encoding of the video, such as one of 

the motion picture experts group (MPEG) standards, e.g., MPEG-1, MPEG-2, MPEG-4, 
the substitution of the bit of watermarked data may be achieved by adjusting the value of 
the DC coefficient, which corresponds to the average value, of at least one of the 
chrominance matrices for the block. For example, the second least significant bit of the 

25 DC coefficient for a block is replaced with the value of the watermark bit that is desired 
to be impressed on the block. 

Which bit of the average value of the chrominance portion is designated to carry 
the watermark data may be a function of a texture variance of the block. It is 
advantageous to increase the significance of the bit position carrying the watermark data 

30 as the texture variance increases, because MPEG-like coding employs greater 
quantization step sizes for higher texture variances, and the use of such greater 
quantization step sizes could result in the elimination, e.g., filtering out, of the 
watermarking data if it is not positioned significantly enough. When using more 
significant bit positions, the values to be added or subtracted from the average value in 

35 order to change the value of a bit position carrying the watermark data to its 
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complementary value may be greater than one. Any texture variance may be used, e.g., 
the texture variance of Y, U, or V, or a combination thereof. 

Whether or not the bit position carrying the watermark data was changed to its 
complement, a "margin" value may be added to the average value in order to better 
5 ensure that the bit of watermark data carried by the average value of the block survives 
any MPEG-like encoding, while minimizing the likelihood of perceivable artifacts 
resulting. 

A receiver determines which of the chrominance components of a block are 
carrying the watermark data and extracts the bit of watermark data from the selected bit 

10 position of the integer portion of the average value of that chrominance component. The 
selected bit position may be determined from a texture variance of the block, e.g., the 
texture variance of the determined chrominance portion of the block or the texture 
variance of the luminance component. 

Advantageously, better response time and improved resistance to temporal 

15 tampering with the video is achieved with respect to the prior art. Further 
advantageously, scene changes do not introduce errors into the impressed data. Yet an 
additional advantage is that even if the original pixel domain version of the video is not 
available, but only a block-based frequency domain encoded version thereof, the video 
may be watermarked without conversion back to the pixel domain. 

20 Summary of the Invention 

While concurrently filed United States Application Serial No. (Case Zarrabizadeh 
22) provides significant advantages heretofore explained, I have recognized that with 
certain color combinations there is still, disadvantageously, the possibility that the 
additional data could cause a slightly visible flickering. Furthermore, in United States 

25 Application Serial No. (Case Zarrabizadeh 22) there is no indication as to the quality of 
the extracted data at the receiver. 

Therefore, in accordance with the principles of the invention, prior to any data 
being impressed upon the average value of a chrominance portion of a block, the data is 
replicated, at least once, and preferably two or more times. The original and each 

30 produced replica is transmitted in the same block position of separate frames using the 
techniques of concurrently filed United States Application Serial No. (Case Zarrabizadeh 
22). The frames that have like-positioned blocks that are carrying the same data are 
considered to be a group, and, preferably, the frames thereof are in consecutive in display 
order. 
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Furthermore, in accordance with an aspect of the invention, specific blocks of the 
frame may be embedded with a particular known data sequence, e.g., a Barker sequence, 
rather than encoded user data. In accordance with another aspect of the invention, the 
specific blocks that are embedded with a known sequence may be scattered throughout 
5 the blocks of a frame. Each group may employ a different known sequence, or the same 
sequence may be used for different consecutive groups. 

In accordance with an aspect of the invention, instead of simply repeating the data 
for each like-positioned block of a group, the amount added to the average value for each 
block based on its complexity may be changed slightly from frame to frame for like 

10 positioned blocks over the group, even when the complexity of the blocks is the same. 
Doing so provides additional coding gain that is advantageous to improve the reliability 
of the data at the receiver. However, doing so may cause a slight reduction in the visual 
quality of low texture areas, because a few pixels within the block may have different 
values than their predecessors in the same location. Nevertheless, because such visual 

15 quality reduction is at the pixel level, it is typically not noticeable. 

At a receiver, the multiple instances of the same data bit in successive frames are 
extracted and combined to form a single received bit. In accordance with an aspect of the 
invention, the known data sequence can be used to determine the quality of each of the 
particular frames. The determined quality is used to specify a weight for the frame, and 

20 the values of the extracted data from each frame may be treated as soft data that is 
weighted by its associated weight as part of the combining process. If the determined 
quality of a particular frame is below a prescribed threshold, it may be assumed that the 
particular frame does not contain any watermarking data and no data is extracted for that 
frame. 

25 Brief Description of the Drawing 

In the drawing: 

FIG. 1 shows an exemplary transmitter for digital watermarking a video signal; 

FIG. 2 shows an exemplary receiver for recovering the additional data of a video 
signal containing digital watermarking on the chrominance signal thereof; 
30 FIGs. 3 A and 3B, when connected together as shown in FIG. 3, show an 

exemplary process for use in watermarking one of the chrominance portions with 
additional data; 

FIGs. 4A and 4B, when connected together as shown in FIG. 4, show an 
exemplary process for extracting the additional information from a digitally watermarked 
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video signal in which the additional information that constitutes the watermarking signal 
within the video signal has been impressed upon the chrominance component; 

FIG. 5 shows an example of several safe ranges where the desired bit position is 
the third least significant bit; 
5 FIG. 6 shows an exemplary process for determining which particular chrominance 

portion is more suitable, and so should be selected, to contain the watermarking 
information for a pixel; 

FIG. 7 shows a cutaway view of a portion of an exemplary divided colorspace; 

FIG. 8 shows another exemplary process by which the particular chrominance 
10 portion is selected to contain the watermarking information for a pixel; 

FIG. 9 shows an exemplary transmitter in which flickering may be reduced, in 
accordance with the principles of the invention, by replicating the data to be impressed, at 
least once, and preferably two or more times, prior to its being impressed upon the 
average value of a chrominance portion of a block; and 
15 FIG. 10 shows an exemplary embodiment of a receiver for use in receiving a 

watermarked video signal, such as that produced by the transmitter of FIG. 9, in 
accordance with the principles of the invention. 

Detailed Description 

The following merely illustrates the principles of the invention. It will thus be 

20 appreciated that those skilled in the art will be able to devise various arrangements which, 
although not explicitly described or shown herein, embody the principles of the invention 
and are included within its spirit and scope. Furthermore, all examples and conditional 
language recited herein are principally intended expressly to be only for pedagogical 
purposes to aid the reader in understanding the principles of the invention and the 

25 concepts contributed by the inventor(s) to furthering the art, and are to be construed as 
being without limitation to such specifically recited examples and conditions. Moreover, 
all statements herein reciting principles, aspects, and embodiments of the invention, as 
well as specific examples thereof, are intended to encompass both structural and 
functional equivalents thereof. Additionally, it is intended that such equivalents include 

30 both currently known equivalents as well as equivalents developed in the future, i.e., any 
elements developed that perform the same function, regardless of structure. 

Thus, for example, it will be appreciated by those skilled in the art that any block 
diagrams herein represent conceptual views of illustrative circuitry embodying the 
principles of the invention. Similarly, it will be appreciated that any flow charts, flow 

35 diagrams, state transition diagrams, pseudocode, and the like represent various processes 
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which may be substantially represented in computer readable medium and so executed by 
a computer or processor, whether or not such computer or processor is explicitly shown. 

The functions of the various elements shown in the FIGs., including any 
functional blocks labeled as "processors", may be provided through the use of dedicated 
5 hardware as well as hardware capable of executing software in association with 
appropriate software. When provided by a processor, the functions may be provided by a 
single dedicated processor, by a single shared processor, or by a plurality of individual 
processors, some of which may be shared. Moreover, explicit use of the term "processor" 
or "controller" should not be construed to refer exclusively to hardware capable of 

10 executing software, and may implicitly include, without limitation, digital signal 
processor (DSP) hardware, network processor, application specific integrated circuit 
(ASIC), field programmable gate array (FPGA), read-only memory (ROM) for storing 
software, random access memory (RAM), and non-volatile storage. Other hardware, 
conventional and/or custom, may also be included. Similarly, any switches shown in the 

15 FIGS, are conceptual only. Their function may be carried out through the operation of 
program logic, through dedicated logic, through the interaction of program control and 
dedicated logic, or even manually, the particular technique being selectable by the 
implementer as more specifically understood from the context. 

In the claims hereof any element expressed as a means for performing a specified 

20 function is intended to encompass any way of performing that function including, for 
example, a) a combination of circuit elements which performs that function or b) software 
in any form, including, therefore, firmware, microcode or the like, combined with 
appropriate circuitry for executing that software to perform the function. The invention 
as defined by such claims resides in the fact that the functionalities provided by the 

25 various recited means are combined and brought together in the manner which the claims 
call for. Applicant thus regards any means which can provide those functionalities as 
equivalent as those shown herein. 

Software modules, or simply modules which are implied to be software, may be 
represented herein as any combination of flowchart elements or other elements indicating 

30 performance of process steps and/or textual description. Such modules may be executed 
by hardware which is expressly or implicitly shown. 

Unless otherwise explicitly specified herein, the drawings are not drawn to scale. 
In the description, identically numbered components within different ones of the 
FIGs. refer to the same components. 

35 FIG. 1 shows exemplary transmitter 101 for digital watermarking a video signal 

by having one or more bits of watermark data carried via an average value of the 



D:\PATENTS\Zarrabizadeh 23\Zarrabizadeh 23.doc 



7 



Zarrabizadeh 23 

chrominance component of each of various blocks of the video signal, on up to a per- 
frame basis. 

Shown in FIG. 1 are a) YUV demultiplexer (demux) and decimator 103, b) color 
selection 105, c) double-pole, double-throw switch 109, d) texture masking unit 111, 
5 e) multiplier 113, f) adder 115, g) multiplexer (mux) 117, h) bit mapper 123, and 
i) summer 133. Also shown in FIG. 1 are optional j) channel encoder 119, and k) block 
interleaver 121. 

YUV demultiplexer and decimator 103 receives a video signal to be watermarked, 
i.e., to have additional information added thereto. YUV demultiplexer and decimator 103 

10 may work with digital video, e.g., video formatted according to the Serial Digital 
Interface (SDI) standard. As will be recognized by those of ordinary skill in the art, any 
video signal not initially in an appropriate digital format may be converted thereto using 
conventional techniques. 

YUV demultiplexer and decimator 103 demultiplexes the luminance (Y) 

15 component of the video and its chrominance component. The chrominance component of 
the video signal has two portions U and V, where U is the differential blue portion and V 
is the differential red portion. 

Much of the processing to embed the additional data on the chrominance 
component is, preferably, performed with a special decimated video format in which for 

20 each original 2x2 luminance block of video, had the original block been in 4-4-4 
representation, there remains only one Y, one U, and one V value. To this end, in the 
event the input video signal is actually in the so-called 4-4-4 format, the image is 
appropriately decimated by YUV demultiplexer and decimator 103 so that for each 
original 2x2 luminance block there is one Y, one U, and one V value. Similarly, in the 

25 event the input video signal is in the so-called "4-2-2" format, i.e., the luminance is full 
resolution while the chrominance portions are a) full resolution vertically only and b) half 
resolution horizontally, YUV demultiplexer and decimator 103 decimates the luminance 
component horizontally and vertically as well as decimates each chrominance portion 
only vertically. Likewise, in the event the input video signal is in the so-called 4-2-0 

30 format, i.e., the luminance component is full resolution while the chrominance portions 
are each only half resolution both vertically and horizontally, the luminance component of 
the image is decimated by YUV demultiplexer and decimator 103 so that for each 
original 2x2 luminance block had the original block been in 4-4-4 representation there 
remains only one Y, one U, and one V value. 

35 The preferred decimated video format may be supplied as an output to color 

selection 105. Thus, preferably, regardless of the format of the input video signal, further 
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processing by the system preferably may be based on the decimated video signal such that 
for every 2x2 block of full resolution luminance pixels of the original input video signal 
there is one Y, one U, and one V value. Those of ordinary skill in the art will be able to 
develop their own methods, should they choose to do so, of developing one Y, one U, and 
5 one V value for every 2x2 block of luminance pixels. 

In order to know the format of the original video, a) an operator may indicate to 
YUV demultiplexer and decimator 103 the particular format of the video supplied to 
transmitter 101, b) the format of the video may be detected directly from the video using 
conventional techniques, or c) the information may be supplied from a higher layer 
10 processor which is supplying the input video signal. 

YUV demultiplexer and decimator 103 may also supply a second set of YUV 
outputs in the full format of the original input video signal to double-pole, double-throw 
switch 109. 

Color selection 105 determines, for any particular pixel, on which portion of the 

15 chrominance component, i.e., on the U portion or the V portion, a change in value, if 
necessary, may be better accommodated without introducing a visible artifact. Color 
selection 105 is based upon a look-up table as described further hereinbelow. 
Alternatively, it may be based all or in part, on various computations, such as in prior 
United States Patent Application Serial No. 10/342,704. 

20 The output of color selection 105 is also used to control the position of 

double-pole, double-throw switch 109. More specifically, the output of color selection 
105 is set so that double-pole, double-throw switch 109 1) supplies, to adder 115, the 
portion of the chrominance component that has been selected to carry the watermark data; 
and 2) supplies, to YUV multiplexer 1 17, the portion of the chrominance component that 

25 was not selected. The output of color selection 105 is also supplied to multiplexer 117 
and to bit mapper 123 for use as described hereinbelow. 

Texture masking unit 1 1 1 analyzes the texture of the luminance area around each 
pixel in the decimated format supplied as output by YUV demux and decimator 103 to 
determine the maximum change in value that can be accommodated by that pixel without 

30 introducing visible artifacts, and supplies as an output a weight indicative thereof The 
weight value may be coded, e.g., taking integral values from 1 to 5. Other values may be 
used, e.g., experiments have indicated that a value of up to 20 may be used in busy areas 
without visual degradation. The weight is supplied to multiplier 113. Texture masking 
unit 1 1 1 may put out a smaller value than the maximum distortion that can be introduced 

35 into a pixel as will be described hereinbelow. 
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Note that the particular values used are at least partially dependent on the number 
of bits used to represent each Y, U, and V value. For example, the foregoing suggested 
weight values of 1 to 5 , and a weight of even up to 20, are for Y, U, and V being 8 bit 
values. Those of ordinary skill in the art will readily recognize that the values employed 
5 for 8 bits may be scaled to 10 bits by multiplying by 4, e.g., shifting the value to the left 
two times. Likewise, other numbers of bits used for Y, U, and V can be similarly 
accommodated. 

Multiplier 113 multiplies the weight received from texture masking unit 111 by a 
value related to the information to be transmitted as part of this pixel, which is supplied 

10 by bit mapper 123. For example, the value supplied by bit mapper 123 may be -1, 0, or 1. 
The product produced by multiplier 1 13 is supplied to adder 115 and summer 133. 

Texture masking unit 111 is responsive to summer 133. In this regard, as noted, 
texture masking unit 1 1 1 may put out a smaller weight value than the change in value 
that can be introduced into a pixel in the event that it receives a signal to that effect from 

15 summer 133. More specifically, summer 133 adds the values supplied by texture 
masking unit 111 for each block. Summer 133 supplies as an output to texture masking 
unit 111a maximum value that texture masking unit 1 1 1 can use as its output weight for 
the pixel currently being processed. The maximum value supplied by summer 133 is the 
lesser of the a) maximum weight value that can be accommodated by a pixel based on the 

20 texture surrounding it and b) the difference between a value supplied by bit mapper 123 
to summer 133 for the block and the current sum for the block. Thus, once the sum 
equals the value supplied by bit mapper 123 to summer 133 for the block, texture 
masking unit 1 1 1 outputs a zero for each remaining pixel of the block. 

Adder 115 produces a modified chrominance portion by adding the value supplied 

25 by multiplier 113 to the value of the portion of the chrominance which was selected by 
color selection 105 to carry the additional information for the pixel. As indicated, the 
portion of the chrominance that was selected by color selection 105 to carry the additional 
information is passed to adder 115 by double-pole, double-throw switch 109. The 
modified chrominance portion supplied by adder 1 15 is supplied to multiplexer 117. 

30 Texture masking unit 111, multiplier 113, bit mapper 123 and summer 133 

cooperate to effectively upsample the value being added to each pixel of the special 
processing resolution to match the format of the chrominance of the original video signal. 
To this end, the resulting upsampled values may be added to the selected chrominance 
portion of each pixel in the original video signal that corresponds to the location of a 

35 pixel in the special reduced resolution format used for processing. For example, if the 
original video signal is in 4-2-2 format, the values determined to be added to each of the 
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pixels of a block in the special processing format are duplicated on a per-line basis so as 
to create a block of values to be added that has 8 pixels per line and 16 lines per block. In 
this block, each of the lines of the nonoverlapping groups of 2 consecutive lines has 
identical values to be added. Such a block corresponds in size to the original block of the 
5 selected chrominance portion of the original video in 4-2-2 format. Each value of the 
resulting upsampled block is added to the selected chrominance portion of the respective, 
like positioned pixel in the original video signal by adder 115. Those of ordinary skill in 
the art will readily be able to perform similar block conversions for different formats. 
Note that for those pixels of a block that color selection 105 did not determine that the 

10 selected chrominance portion could better accommodate a change, the value added will 
be zero. If the original video signal is in 4-2-0 format, no upsampling is required. 

Alternatively, only the decimated special processing resolution format is 
processed. The resulting modified chrominance portion is then upsampled, e.g., in 
multiplexer 117. However, doing so may result in some degradation of the original video 

15 signal, although such degradation need not be visible. 

Multiplexer 117 receives the original luminance component (Y) and the 
unmodified chrominance portion that was supplied from YUV demultiplexer and 
decimator 103 via double-pole, double-throw switch 109. Multiplexer 117 also receives 
the modified chrominance portion from adder 115. Multiplexer 117 then multiplexes 

20 together the original luminance component (Y), the unmodified chrominance portion, and 
the modified chrominance portion. Multiplexer 117 knows on which lead it receives the 
modified portion of the chrominance component and on which lead it receives the 
unmodified portion of the chrominance component by virtue of receiving the output of 
color selection 105. The resulting video signal is supplied as the watermarked output 

25 video signal. 

Those of ordinary skill in the art will be able to develop embodiments of the 
invention in which the additional data is added to the original chrominance signal portion 
rather than the decimated version thereof, so that upsampling will not be required. 

As indicated above, the binary data value, i.e., 1 or 0, of the additional 
30 information which is to be transmitted for each block may be supplied directly to bit 
mapper 123 for use as the watermark data or it may first be processed to facilitate the 
processing and recovery of the information at the receiver. Such exemplary processing 
may be performed by optional channel encoder 1 19 and block interleaver 121. 

Channel encoder 119 receives the additional data that is desired to be embedded 
35 in the video stream. This data is then encoded, e.g., using a forward error correcting 
coding scheme. Such forward error correcting scheme may be any conventional forward 
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error correcting scheme, such as convolutional encoding, e.g., Viterbi encoding or turbo 
encoding, or it may be any newly developed coding scheme. In one exemplary 
arrangement, convolutional coding of rate one-half is used. As a result of such coding, 
two bits are produced for every bit of the original bit stream. The channel encoded bit 
5 stream is supplied as an output by channel encoder 1 19 to block interleaver unit 121. 

Block interleaver 121 rearranges the order of the bits of the channel encoded bit 
stream in order to randomly distribute the data. Doing so helps reduce the chance that 
adjacent sections of the channel encoded bit stream are lost, e.g., due to bursts of noise or 
other factors, which would then make it difficult to recover such data at the receiver from 

10 the remaining, actually received data. In one arrangement the number of bits that are 
interleaved as a unit is equal to the number of blocks in a frame. A block interleaver may 
be implemented by writing data sequentially to the rows of a block left to right, at the end 
of each row starting again at the leftmost position of the next row down, and then reading 
the data by starting at the leftmost topmost position of the block and reading down a 

15 column until the end of the column is reached at which point reading continues at the top 
of the next column. A block interleaver of 45 rows by 30 columns has proven effective 
for a picture size of 720 by 480 pixels. For different resolutions, those of ordinary skill in 
the art will be readily able to develop comparable block encoders. The interleaved 
channel encoded bit stream is supplied as an output by bit interleaver 121 to bit mapper 

20 123. 

The data bit supplied by block interleaver 121 is impressed as the watermark data, 
under the control of bit mapper 123, upon at least one block of at least one frame of the 
original video signal. Bit mapper 123 controls the insertion of the watermark data into 
one of the bit positions of the average value of at least a selected one of the chrominance 

25 portions of each block upon which the data is to be impressed, thus effectively replacing 
the bit at that bit position. 

For example, when the watermark data is to be carried in the least significant bit 
of the integer portion of the average of the selected chrominance portion of the block, the 
value that needs to be added to the average value is 0 or 1 . Zero is added when the least 

30 significant bit of the integer portion of the average value is already the same as the 
watermark data bit to be carried and 1 is added when the least significant bit of the integer 
portion of the average value is the complement of the watermark data bit to be carried. 
When the watermark data is to be carried in the second to the least significant bit of the 
integer portion of the average of the selected chrominance portion of the block, the value 

35 of the data to be added to the pixel is -1, 0, or 1. Zero is added when the second least 
significant bit of the integer portion of the average value is already the same as the 
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watermark data bit to be carried and 1 or -1 is added when the second least significant bit 
of the integer portion of the average value is the complement of the watermark data bit to 
be carried. Whether 1 or -1 is added depends on which will cause the smallest change to 
the average value while changing the second least significant bit of the integer portion of 
5 the average value to its complement. Using the second to least significant bit the data to 
be embedded is more likely to survive encoding by MPEG or a similar process. When 
the data to be placed in the third to the least significant bit of the integer portion of the 
average of the selected chrominance portion of the block, the value of the data to be 
added to the pixel is —2, -1,0, 1, or 2. Zero is added when the third least significant bit of 

10 the integer portion of the average value is already the same as the watermark data bit to be 
carried and is -2, -1, 1, or 2 is added when the third least significant bit of the integer 
portion of the average value is the complement of the watermark data bit to be carried. 
Whether is -2, -1, 1, or 2 is added depends on which will cause the smallest change to the 
average value while changing the third least significant bit of the integer portion of the 

15 average value to its complement. Using the third to least significant bit the data to be 
embedded is even more likely to survive encoding by MPEG or a similar process to 
achieve adequate results. From the foregoing, those of ordinary skill in the art will 
readily be able to determine the values to be added for more significant bit positions 
which are determined by the user or the system. 

20 To this end, bit mapper 123 develops a value that is distributively added to a 

selected chrominance portion of the pixels of a block such that doing so changes the 
average of the value of that chrominance portion for that block so that the bit supplied by 
block interleaver 121 that is being impressed is placed in a selected bit position of the 
average value of the selected chrominance portion. This value is the value to be added to 

25 the average value of the selected chrominance portion to place the watermark data bit in 
the appropriate bit position multiplied by the number of pixels in a block. In other words, 
the value developed by bit mapper 123 that is to be added to the average of the value of 
that chrominance portion is divided up into smaller values that are added to individual 
pixels of the block, so that the total of the smaller values added to the block divided by 

30 the number of pixels in the block equals the value to be added to the average value of the 
selected chrominance portion. 

The particular bit average of the value of the chrominance portion for that block, 
e.g., the DC coefficient for that chrominance portion, onto which the data supplied by bit 
mapper 123 is impressed, is determined by bit mapper 123. In one arrangement, the 

35 second least significant bit of the DC coefficient for a block is replaced with the particular 
value that is desired to be impressed on the block. In another arrangement, which bit of 
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the DC coefficient that is replaced may be a function of the texture variance of the block. 
It is advantageous to increase the significance of the bit which is replaced as the texture 
variance increases, because the MPEG coding standards employ greater quantization step 
sizes for higher texture variances, and the use of such greater quantization step sizes 
could filter out the watermark data bit if it is positioned in a bit position that is not 
significant enough. When using more significant bits, the values to be added or 
subtracted from the DC coefficient in order to change the bit being substituted to its 
complementary value may be greater than one. To this end, bit mapper 123 receives the 
average variance of the luminance component for the block from texture masking 111, 
and based on the average variance, determines which bit position is to be replaced. The 
greater the variance, the more significant the bit position into which the watermark data is 
placed. 

Bit mapper 123 supplies the data bit from the interleaved channel encoded bit 
stream that is to be communicated for each block of the original video signal at the 
appropriate time for each pixel of the block of the original video signal when that pixel is 
to be incorporated into the watermarked output video signal. Thus, bit mapper 123 takes 
into account the fact that the processing of the video signal is line based, i.e., the 
processing is left to right on a line, then down to the next line and left to right again, 
causing the adjacent pixels of a block to not necessarily be located sequentially in the 
video stream and therefore to not all be processed in time directly one after the other. The 
particular data bit supplied as an output of bit mapper 123 at any time is supplied as an 
input to multiplier 113. 

Using an encoder, such as shown in FIG. 1, a bit rate of around 6,750 bits per 
second, substantially error free, has been achieved for the additional information as 
supplied to channel encoder 119 when the video frame size is 720 x 480 pixels. 

Those of ordinary skill in the art will readily recognize from the above description 
that various ones of the units in FIG. 1 require storage in order to first determine the 
values which must be computed using information from an entire block, e.g., the original 
average value of the block and the average texture variance of the block, and then to 
employ those values in processing the individual pixels. Consequently, there is typically 
a one slice delay, where a slice is a strip of blocks horizontally all the way across a frame. 

FIG. 2 shows exemplary receiver 201 for recovering the additional data of a video 
signal containing digital watermarking on the chrominance signal thereof. Shown in FIG. 
2 are a) YUV demultiplexer (demux) and decimator 203, b) color selection unit 207, 
c) double pole double throw switch 209, d) block variance calculation 211, e) block 
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integrator V 213, f) block integrator U 215, g) bit selection 217, h) deinterleaver 219, and 
i) channel decoder 221. 

YUV demultiplexer and decimator 203, which may be substantially the same as 
YUV demultiplexer and decimator 103 of transmitter 101 (FIG. 1), receives a video 
signal that has been digitally watermarked in that additional information has been added 
thereto on the chrominance component of the signal. YUV demultiplexer and decimator 
203 works with digital video, e.g., formatted according to the serial digital interface 
(SDI). As will be recognized by those of ordinary skill in the art, any video signal not 
initially in an appropriate digital format may be converted thereto using conventional 
techniques. 

YUV demultiplexer and decimator 203 demultiplexes the luminance (Y) 
component of the video and its chrominance component and decimates it to the preferred 
processing format in which for each original 2x2 luminance block of video, had the 
original block been in 4-4-4 representation, there remains only one Y, one U, and one V 
value. In order to know the format of the received video, a) the operator needs to indicate 
to YUV demultiplexer and decimator 203 the particular format of the input video, b) the 
format of the video may be detected directly from the video using conventional 
techniques, or c) the information may be supplied from a higher layer processor which is 
supplying the input video signal. The demultiplexed luminance and chrominance 
components are supplied to color selection 207. In addition, the luminance component is 
supplied to block variance calculation 211, the V chrominance portion is supplied to 
block integrator V 213, and the U chrominance portion is supplied to block 
integrator U 215. Unlike YUV demultiplexer and decimator 103, YUV demultiplexer 
and decimator 203 need not also supply a second set of YUV outputs in the full format of 
the original input video signal. 

Color selection unit 207 determines for each block on which portion of the 
chrominance component, i.e., on the U portion or the V portion, it was likely that the 
additional information was embedded. The output of color selection unit 207 is used to 
control the position of double pole double throw switch 209. More specifically, color 
selection unit 209 selects the chrominance portion U or V, as a function of Y, U, and V, 
as will be described in more detail hereinbelow, on which the additional information was 
likely to have been embedded for this block. In one arrangement, color selection unit 207 
is based on a lookup table. Doing so simplifies the process by avoiding the need for 
YUV to RGB conversion, which might otherwise be necessary. 

Note that the input to color selection unit 207 is individual pixels. Color selection 
unit 207 keeps track of the pixels in each block and combines the individual U or V 
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selection for each pixel in the block. The particular component that has the highest value, 
i.e., was most often selected for the pixels within a block, is determined to be the output 
of color selection 207. The output of color selection unit 207 is then set so that switch 
209 supplies to bit selection 217 the integrated version of the portion of the chrominance 
5 component to which the additional data was determined to have been added. 

Block variance calculation 211 determines the particular bit of the average of the 
value of the selected chrominance portion for that block, e.g., the DC coefficient for the 
selected chrominance portion, that likely contains the impressed data. As noted, in one 
arrangment, bit mapper 123 (FIG. 1) received and employed the average of the variances 

10 of the luminance component of the pixels of the block, to determine which bit position is 
to be replaced with the watermark data bit to be impressed. The greater the variance, the 
more significant the bit position that should be replaced. Block variance calculation 21 1 
(FIG. 2) should base its calculation on the same information used by mapper 123 to 
replicate its determination. The output of block variance calculation 21 1 is supplied to bit 

15 selection 217. 

Block integrator V 213 integrates the values of V over a block, i.e., the values for 
each pixel in a block are combined, e.g., added together. Block integrator U 215 
integrates the values of U over a block, i.e., the values for each pixel in a block are 
combined, e.g., added together. 

20 Bit selection 217 extracts the bit at the bit position specified by block variance 

calculation 211 from the integrated chrominance portion value supplied to it by switch 
209 as the data for the block. 

Deinterleaver 219 reorders the data to undo the effect of block interleaver 121 
(FIG. 1) of transmitter 101. The reordered values are then supplied to channel decoder 

25 221 (FIG. 2), which performs appropriate decoding for a signal that was encoded using 
the type of encoding employed by channel encoder 1 19 of transmitter 101 (FIG. 1). The 
resulting decoded values are supplied by channel decoder 221 (FIG. 2) as the 
reconstructed version of the additional data signal. For further robustness, channel 
decoder 221 may be a so-called "sequence decoder", e.g., a turbo decoder. 

30 FIGs. 3A and 3B, when connected together as shown in FIG. 3, show an 

exemplary process for use in watermarking one of the chrominance portions with 
additional data. For those blocks where the determined bit position is already the same as 
the value to be impressed, the block may be transmitted unmodified. The process of FIG. 
3 may be performed in a system such as is shown in FIG. 1 . 

35 The process may be entered in step 301 when all the pixels of a block are 

available. Part of the processing of FIG. 3 takes place on a block-by-block basis, and part 
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on a pixel-by-pixel basis. The blocks of a frame are indexed using a two-dimensional 
pointer p,q, where p points to the particular horizontal slice of the frame that is being 
processed and q points to the particular column, or vertical slice, of the frame. For 
example, for 720x480 resolution p ranges between 1 and 30 and q between 1 and 45. 
Similarly, the pixels of each block are indexed using a two-dimensional pointer zj, where 
/ points to the particular row within the block that is being processed and j points to the 
particular column within the block that is being processed. For example, in the special 
processing mode employed to impress the data, where each macroblock of original video 
has only a corresponding 8x8 block of Y, U, and V, both / and j range between 0 and 7. 

After entering the process in step 301, several variables that are used in the 
process are initialized in step 303, e.g., countU(p,9)=0, countV(p^)=0, sumU(p,^>=0, 
sumV(p,<7)=0, and var(p,qr)=0. CountU is a running total of how many pixels within the 
block are selected by the color selection process as being suitable for watermarking on the 
U chrominance portion while count V is a running total of how many pixels within the 
block are selected by the color selection process as being suitable for watermarking on the 
V chrominance portion. SumU and sumV are the running total values of U and V 
respectively over all the pixels of the block. Where watermarking is only performed only 
on pixels of the chrominance portion selected for the block, there is no use for the one of 
sumU and sumV that is developed for the chrominance portion that is not selected. 

In step 305, var(p,?), the total of the variance of the luminance for each individual 
pixel within the block, which is, of course, proportional to the average variance of the 
luminance for the block, is computed. To this end, i and j are initially both set to point to 
the first pixel of the block to be processed, e.g., *=0 and j=0. The value of var(p,q), is 
computed by cycling through each pixel of the block, changing the values of / and j as 
appropriate to do so, and adding together the variance of the luminance for each pixel to 
the current total of var(p,q). 

The variance of the luminance for any particular pixel may computed by taking 
the absolute value of the difference in the luminance between the pixel and all of its 
nearest neighbors. Mathematically, where all of the nearest neighbors are within the 
same block, this may be written as 

var(p^)=var(p^)+(| Yjfi? - 1+1 Y™ - |+| tf» ) - Y^;% |+| ]J«> - | + | Y}>f - 

y(P>i) i+l y(P>l) y(P,l) i . | y(P,<i) y(p,q) . , i y{p,q) y(p,<i) , . i y(p,q) y(n,q) h 

/ (/+i,y+i)n / (/j) " / (/+i,»n / (u) ■ / { f >i)H / (/j) "^/-ij+ol* \ J {u) " / (/+i../-d^ 

Those of ordinary skill in the art will readily be able to adapt the foregoing to 
those pixels whose nearest neighbors are in other blocks. Furthermore, for those blocks 
that are near the borders of the frame, and hence have no nearest neighbors, or the nearest 
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neighbors are part of those blocks that are not displayed, the value of such neighbors may 
be considered to be zero. 

Note that not all of a pixel's nearest neighbors need be considered in the variance 
computation and yet sufficiently high quality results can be achieved. More specifically, 
it is advantageous in that computation time for each pixel is reduced by taking only the 
differences of the 4 pixels in the corners of the rectangle surrounding the pixel and 2 of 
the other pixels that form a vertical or horizontal line with the pixel, e.g., the 2 pixels on 
the horizontal line with the pixel. 

Thereafter, conditional branch point 307 tests to determine which particular 
chrominance portion, i.e., U or V, is going to contain the watermark information for the 
block. This is done by evaluating the color selection for each pixel in the block and 
counting the number of pixels within the block that are selected for each chrominance 
portion. The chrominance portion that was selected the most for the block is chosen for 
watermarking. Note that it may be determined that a particular pixel is unsuitable for 
watermarking at all. In such a case, it is not counted towards the total number of pixels 
for either U or V. 

The particular method of determining the color selected to be watermarked for 
each pixel is at the discretion of the implementer. For example, the chrominance portion 
of the pixel with the smallest value is selected. Alternatively, the color selection 
arrangement described hereinbelow is employed. 

Next, the bit position of the average value of the selected chrominance portion 
that will contain the watermarked bit is determined. The bit position is selected so that 
the watermarked bit will survive any subsequent quantization, such as takes place in 
MPEG-like encoding. 

To this end, if the test result in step 307 is that the V chrominance portion is 
selected to be watermarked, control passes to step 309, in which a variable 
watermarkcolor is set equal to V. Thereafter, conditional branch point 323, which tests 
to determine whether the average Y variance over the block, vav(p,q), is greater than a 
first prescribed V threshold tlv, which is the largest V threshold. An exemplary value of 
tlvis 600. 

Note that the particular threshold values used in connection with FIGs. 3 and 4 for 
both U and V are at least partially dependent on the number of bits used to represent each 
Y value, when the average Y variance is compared with the suggested threshold. For 
example, the suggested threshold values herein are for Y being an 8 bit value. Those of 
ordinary skill in the art will readily recognize that the values employed for 8 bits may be 
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scaled to 10 bits by multiplying by 4, e.g., shifting the value to the left two times. 
Likewise, other numbers of bits used for Y, U, and V can be similarly accommodated. 

Instead of using the average Y variance over the block for the various 
comparisons, a different average variance, e.g., the average V variance over the block, 
5 may be calculated and employed. 

If the test result in step 323 is YES, indicating that the variance is large enough 
that the additional data should be encoded on the 5 th least significant bit of the average of 
the V values of the pixels of the block, e.g., the value of int[sumV(p,q)/(number of pixels 
per block)], e.g., int[sumV(p,q)/64], is greater than tlv, control is passed to step 325, in 
10 which a variable m is set equal to 5. 

Note that instead of using the integer function int for rounding, as is used herein, 
any other form of rounding to achieve an integer value may be employed, e.g., always 
rounding up or always rounding to the nearest integer value. 

If the test result in step 323 is NO, indicating that the variance was not large 
15 enough that additional the data should be encoded on the 5 th least significant bit of the 
average value of the V values of the pixels of the block, control passes to conditional 
branch point 329, which tests to determine if the average Y variance over the block, 
var(p,#), is greater than a second prescribed V threshold, t2v, which is the second largest 
V threshold. An exemplary value of t2v is 1 5. 
20 If the test result in step 329 is YES, indicating that the additional data should be 

encoded on the 4 th least significant bit of the average of the V values of the pixels of the 
block, control is passed to step 33 1, in which variable m is set equal to 4. 

If the test result in step 329 is NO, indicating that the variance was not large 
enough that the additional data should be encoded on the 4 th least significant bit of the 
25 average of the V values of the block, control passes to conditional branch point 333, 
which tests to determine if the average Y variance over the block, var(/?,g), is greater than 
a third prescribed V threshold, t3v, which is the smallest V threshold. An exemplary 
value of t3v is 7. 

If the test result in step 333 is YES, indicating that the variance is large enough 
30 that the data should be encoded on the 3 rd least significant bit of the average of the V 
values of the pixels of the block, control is passed to step 335, in which variable m is set 
equal to 3. 

If the test result in step 333 is NO, indicating that the variance is only large 
enough that the data should be encoded on the 2 nd least significant bit of the average 
35 value of the V value of the block control is passed to step 337, in which variable m is set 
equal to 2. 
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If the test result in step 307 is that the U is the chrominance portion is selected to 
be watermarked, control passes to step 311, in which the variable watermarkcolor is set 
equal to U. Thereafter, conditional branch point 343 tests to determine whether the 
average Y variance over the block, var(p,q), is greater than a first prescribed threshold 
5 tlu, which is the largest threshold. An exemplary value of tlu is 600. 

Instead of using the average Y variance over the block for the various 
comparisons, the average U variance over the block may be calculated and employed. 

If the test result in step 343 is YES, indicating that the variance is large enough 
that the data needs to be encoded on the 5 th least significant bit of the average of the U 
10 values of the pixels of the block, e.g., the value of int[sumV(p,q)/(number of pixels per 
block)], e.g., int[sumU(p,q)/64], is greater than tlu, control is passed to step 345, in 
which variable m is set equal to 5. 

Note that instead of using the integer function int for rounding herein, any other 
form of rounding to achieve an integer value may be employed, e.g., always rounding up 
1 5 or rounding to the nearest integer value. 

If the test result in step 343 is NO, indicating that the variance was not large 
enough that the data needed to be encoded on the 5 th least significant bit of the average of 
the U values of the pixels of the block, control passes to conditional branch point 349, 
which tests to determine if the average Y variance over the block, var(p ; ^), is greater than 
20 a second prescribed threshold t2u, which is the second largest U threshold. An exemplary 
value of t2u is 15. 

If the test result in step 349 is YES, indicating that the data needs to be encoded 
on the 4 th least significant bit of the average of the U values of the pixels of the block, 
control passes to step 351, in which variable m is set equal to 4. 

25 If the test result in step 349 is NO, indicating that the variance was not large 

enough that the data should be encoded on the 4 th least significant bit of the average of 
the U value of the pixels of the block, control passes to conditional branch point 353, 
which tests to determine if the average Y variance over the block, var(p,q), is greater than 
a third prescribed threshold t3u, which is the smallest U threshold. An exemplary value 

30 oft3uis7. 

If the test result in step 353 is YES, indicating that the variance is large enough 
that the data should be encoded on the 3 rd least significant bit of the average of the U 
values of the pixels of the block, control passes to step 355, in which variable m is set 
equal to 3. 

35 If the test result in step 353 is NO, indicating that the variance is only large 

enough that the data should be encoded on the 2 nd least significant bit of the average of 
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the U values of the pixels of the block, control is passed to step 357, in which variable m 
is set equal to 2. 

Once the particular bit of the average value over the block of the selected 
chrominance portion to be employed to contain the watermarked data is determined, the 
5 process to make certain that that bit position contains the desired bit is undertaken. The 
goal of the process is to add or subtract the minimum possible value from the current 
average value of the selected chrominance portion to make certain that the desired bit 
position has the value of the watermarking bit to be transmitted. The desired bit position 
may be a bit position within the integer portion of the average value. To this end, ideally, 

10 if the desired bit position already contains the value of the watermarking bit to be 
transmitted, nothing may be added to the current average value of the selected 
chrominance portion. On the other hand, if the desired bit position contains the 
complement of the value of the watermarking bit to be transmitted, ideally, only the 
smallest possible value that will flip the desired bit position to its complement by being 

15 either added to or subtracted from the desired bit position, and hence causing the least 
change in the value of the average value of the selected chrominance portion from its 
current unwatermarked value to its final watermarked value, is added to or subtracted 
from the desired bit position as appropriate. 

In practice, due to quantization noise, rounding as part of the inventive process, 

20 and other factors of the MPEG-like encoding process that may impact the final value of 
the desired bit, a slightly different value may be added or subtracted as explained further 
herein. More specifically, a "safe" range of values having the desired bit value at the 
desired bit position is selected, and the minimum value is either added or subtracted to 
the average value of the selected chrominance portion so that the final value has the 

25 desired bit value at the desired bit position and it is within the safe range. Thus, typically, 
whenever a bit of the average value needs to be changed to its complement to carry the 
watermark data, the resulting value is always at the border of a safe range. When the 
value at the desired bit position is already the value of the watermark data bit to be 
transmitted, if the average value of the selected chrominance portion is already within the 

30 safe range, then nothing needs to be added to the average value of the selected 
chrominance portion. However, when the average value of the selected chrominance 
portion is not already within the safe range, then the minimum value necessary to change 
the average value of the selected chrominance portion to be a value within the safe range, 
while keeping the value of the desired bit position at the value of the watermarking bit to 

35 be transmitted, is added to, or subtracted, from the average value of the selected 
chrominance portion. 
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Conceptually, the foregoing may be thought of as first adding or subtracting the 
minimum value to achieve the desired watermarking value at the desired bit position, and 
then adding or subtracting a further amount, e.g., a margin value, to insure that the final 
value is within the safe range. 
5 FIG. 5 shows an example of several safe ranges where the desired bit position is 

the third least significant bit. Along the axis are shown the average value of the selected 
chrominance portion 

Table 1 shows (code) (table of values) 

Upon completion of steps 325, 331, 335, 337, 345, 351, 355 and 357, control 

10 passes to conditional branch point 361, which tests to determine if the bit of 
watermarking data to be impressed on the block is the same as the current identified bit 
position for the average value of the chrominance portion identified by the variable 
watermarkcolor. If the test result in step 361 is YES, indicating that the bit of 
watermarking data to be impressed on the block is the same as the current identified bit 

15 position for the average value of the chrominance portion identified by the variable 
watermarkcolor, and that therefore the bit does not need to be changed to its 
complementary value, control passes to step 363, which tests to determine if the value is 
within the safe range for the current bit position. If the test result is NO, indicating that 
an error might be introduced during subsequent processing, control passes to step 365, 

20 which sets the variable changevalue to be equal to the value needed to move the current 
average value for the color indicated by watermarkcolor into the nearest safe range 
without changing the value of the desired bit position. Note that the value need not be an 
integer value, and it may also be a negative value. If the test result in step 363 is NO, 
indicating that the current average value for the color indicated by watermarkcolor is 

25 already within a safe range, control passes to step 367, and the value of changevalue is set 
equal to zero. 

If the test result in step 361 is NO, indicating that the bit of watermarking data to 
be impressed on the block is not the same as the current identified bit position for the 
average value of the chrominance portion identified by the variable watermarkcolor, and 

30 that therefore the value of the bit must be changed to its complementary value so as to 
properly carry the watermarking data, control passes to step 369, which tests to determine 
if the nearest safe range for the current bit position is greater or smaller than the current 
average value of the color indicated by watermarkcolor. If the test result in step 369 is 
GREATER, indicating that the values of the nearest safe range for the current bit position 

35 is greater than the current average value of the color indicated by watermarkcolor, control 
passes to step 371 in which the value of variable changevalue is set equal to the smallest 
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value to add to the average value so that the resulting value is within the adjacent safe 
range with bigger values. Note that this value need not be an integer value. If the test 
result in step 369 is SMALLER, indicating that the values of the nearest safe range for the 
current bit position is smaller than the current average value of the color indicated by 
5 watermarkcolor, control passes to step 373 in which the value of variable changevalue is 
set equal to the smallest negative value that when added to the average value results in a 
value that is within the adjacent safe range with smaller values. Again, note that this 
value need not be an integer value, and it may also be a negative value. 

Upon conclusion of step 365, 367, 371, or 373, control passes to step 375 in 

10 which the total to add to the pixels is set equal to the product of the number of pixels per 
block and the value of changevalue. If the resulting product value is not an integer, the 
value is rounded off The rounding may be performed in a manner consistent with the 
steps 365, 371, and 373, in that if a negative value was added, the rounding is down by 
taking the integer portion of the value, while if a positive value was added the rounding is 

1 5 up toward the next whole integer value. 

Processing now changes from a per-block level to a per-pixel level within the 
block. In step 377, the first pixel of the block is pointed to. Thereafter, conditional 
branch point 379 tests to determine if the current pixel is to be watermarked, based on its 
color. This is done by determining if the chrominance component of this pixel that is 

20 suitable for watermarking is the same as the color selected in step 307 for the entire 
block. If the test result in step 379 is YES, indicating that this pixel should be 
watermarked, control passes to step 381, in which a value is added to the current pixel 
based on the luminance variance for the pixel and the total values added so far to the 
pixels of the block. 

25 More specifically, a maximum value that can be added to the pixel without 

introducing a visible artifact is determined as a function of the variance of the luminance. 
The greater the variance of the luminance, the greater the value that can be added, up to a 
prescribed maximum. Note that this value may be positive or negative. This value is 
then added to pixel if the total to be added to the pixels is a positive value, or the value is 

30 subtracted from the pixel if the total to be added to the pixels is a negative value. 
However, as the per-pixel processing proceeds running total of the values added or 
subtracted are subtracted from the total to be added to the pixels. If the value to be added 
to the current pixel will make the difference between the total to be added to the pixels 
and the running total cross zero, then the value is adjusted so that the running total just 

35 equals zero. 
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If the test result in step 379 is NO, or after completing step 381, control passes to 
conditional branch point 383, which tests to determine if the current pixel is the last pixel 
of the block. If the test result in step 383 is NO, control passes to step 385 which tests to 
determine if the total to be added to the pixels of the block has already been added, i.e., is 
5 the running total equal to the total to be added to the pixels of the block. If the test result 
in step 385 is NO, indicating that there is more that needs to be added to the pixels of the 
block, control passes to step 387, which points to the next pixel of the block. Control 
then passes back to step 379, and the process continues as described above. 

If the test result in either of steps 383 or 385 is YES, indicating that either all the 

10 pixels of the block have been processed or all of the total that need to be added has been 
added, control passes to step 389 and the process is exited. 

FIGs. 4A and 4B, when connected together as shown in FIG. 4, show an 
exemplary process for extracting the additional information from a digitally watermarked 
video signal in which the additional information that constitutes the watermarking signal 

15 within the video signal has been impressed upon the chrominance component. Such a 
process may be implemented by a system such as the one shown in FIG. 2, across color 
selection 207, double pole double throw switch 209, block variance calculation 211, 
block integrator V 213, block integrator U 215 and bit selection 217 (FIG. 2). 

The process is entered in step 401 (FIG. 4) when a new block of the received 

20 decimated frame is to be processed. Note that for pedagogical purposes it is assumed 
herein that pixels are supplied for processing by the process of FIG. 4 grouped by block, 
so that all the pixels of a block are processed prior to any pixels of the next block being 
processed. However, in designing an actual system, those of ordinary skill in the art will 
readily recognize that the pixels may be processed in the same order that they are scanned 

25 and that appropriate memory locations and control structures may be used so as to 
effectively separately process the blocks. 

Part of the processing of FIG. 4 takes place on a block-by-block basis, and part on 
a pixel-by-pixel basis. The blocks of a frame are indexed using a two-dimensional 
pointer p,q, where p points to the particular horizontal slice of the frame that is being 

30 processed and q points to the particular column, or vertical slice, of the frame. For 
example, for 720x480 resolution, p ranges between 1 and 30 and q between 1 and 45. 
Similarly, the pixels of each block are indexed using a two-dimensional pointer ij\ where 
i points to the particular row within the block that is being processed and j points to the 
particular column within the block that is being processed. For example, in the special 

35 processing mode employed to impress the data, where each macroblock of original video 
has only a corresponding 8x8 block of Y, U, and V, both i and j range between 0 and 7. 
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After entering the process in step 401, several variables that are used in the 
process are initialized in step 403, e.g., countU(##)=0, countV(/?,#)=0, sumU(/?,#)=0, 
sumV(/?,#)=0, and var(p,q)=0. CountU and countV are a running total of how many 
pixels within the block were selected by the color selection process as being U and V, 
5 respectively, while sumU and sumV are the running total values of U and V, respectively, 
over all the pixels of the block. For the block, i and j are both set to point to the first 
pixel of the block to be processed, e.g., i=0 and y=0 as well For each block, var(p,q) 
represents the total of the variance of the luminance for each individual pixel within the 
block, which is, of course, proportional to the average variance of the luminance for the 
10 block. 

Thereafter, in step 405, the Y, U and V values for the currently pointed to pixel of 
the currently being processed block is obtained, e.g., the values of U { J$ and 

Vyj)* are obtained. The current values of U and V are added to the respective current 

values of sumU and sumV in step 407. Also in step 407 the variance of the luminance, 
15 var(p,q), is updated by adding the variance of the luminance for the current pixel to the 
current total of var(p,#). The variance of the luminance for the current pixel may 
computed by taking the absolute value of the difference in the luminance between the 
current pixel and all of its nearest neighbors. Mathematically, where all of the nearest 
neighbors are within the same block this may be written as 
20 varfo^varfo q)+(\ F ( <™> - Yft% | + | Yfcf - |+| - ]j£> | + | Yft? - Y^f |+| Y™ - 

y(p,<i) j . I y{p,q) y(p,q) i,| y(p,<f) yip,q) hi y(p,q) y(p^) i, i y(p,v) y(/v/) { \ 
I (i+\J+\)r\ I (iJ) ■ / (/+i,7)H / (/J) ' I {iJ^)^\ I (iJ) (/,./) 

Those of ordinary skill in the art will readily be able to adapt the foregoing to 
those pixels whose nearest neighbors are in other blocks. Furthermore, for those blocks 
that are near the borders of the frame, and hence have no nearest neighbors, or the nearest 
25 neighbors are part of those blocks that are not displayed, the value of such neighbors may 
be considered to be zero. 

Not all of the nearest neighbors need be considered and yet sufficiently high 
quality results can be achieved. More specifically, it is advantageous in that computation 
time is reduced to take the differences of the 4 pixels in the corners of the rectangle 
30 surrounding the current pixel and 2 of the other pixels that form a vertical or horizontal 
line with the current pixel, e.g., the 2 pixels on the horizontal line with the current pixel. 
However, the decoder should match the same process that was employed in the encoder. 

Control passes to conditional branch point 409, which tests to determine on which 
of U or V it was likely that the additional data was impressed. The details of this 
35 determination will be described further hereinbelow. If the test result in step 409 is U, 
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indicating that the additional data was most likely impressed on U for the current pixel, 
control passes to step 41 1, in which countU is incremented. Control then passes to step 
413. If the test result in step 409 is V, indicating that the additional data was most likely 
impressed on V for the current pixel, control passes to step 415, in which countV is 
5 incremented. Control then passes to step 413. 

Alternatively, conditional branch point 409 may be a three-way test, with an 
additional result indicating that it is likely that data was not impressed on the pixel at all, 
i.e., not on the U or the V. If such is the result, control simply passes directly to step 413. 
Conditional branch point 413 tests to determine if the current pixel is the last pixel 

10 of the current block. If the test result in step 413 is NO, indicating that there remains 
additional pixels in the current block that have yet to be processed, control passes to step 
417, in which the values of / and j are adjusted to point to the next as-of-yet-not- 
processed pixel. Control then passes back to step 405 and the process continues as 
described above. If the test result in step 413 is YES, indicating that all the pixels of the 

15 current block have been processed, control passes to step 419, in which the variance of 
the decimated luminance for the block is calculated, i.e., the variance of the 8x8 Y block 
is calculated. 

Control then passes to conditional branch point 421, which tests to determine if 
count V>countU for the current block. If the test result in step 421 is that countV is 
20 indeed greater than countU, control passes to conditional branch point 423, which tests to 
determine whether the average Y variance over the block, var(p,<7), is greater than a first 
prescribed threshold tlv, which is the largest V threshold. An exemplary value of tlv is 
600. 

Alternatively, instead of using the average Y variance over the block for the 
25 various comparisons, the average U or the average V variance over the block may be 
calculated and employed, e.g., whichever has the greater count value. 

If the test result in step 423 is YES, indicating that the variance is large enough 
that the data was likely to have been encoded on the 5 th least significant bit of the integer 
portion of the average of the V values of the pixels of the block, e.g., the value of 
30 int[sumV(p,q)/(number of pixels per block)], e.g., int[sumV(p,q)/64], control is passed to 
step 425, in which a variable m is set equal to 5. Control then passes to step 427, in 
which the value of the m ih least significant bit of the average of the V values of the pixels 
of the block is extracted as the value impressed upon this block. The process is then 
exited in step 459. 
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Note that instead of using the integer function int for rounding herein, any other 
form of rounding to achieve an integer value may be employed, e.g., always rounding up 
or rounding to the nearest integer value. 

If the test result in step 423 is NO, indicating that the variance was not large 
5 enough that the data was likely to have been encoded on the 5 th least significant bit of the 
integer portion of the average of the V values of the pixels of the block, control passes to 
conditional branch point 429, which tests to determine if the average Y variance over the 
block, var(p,q), is greater than a second prescribed threshold t2v, which is the second 
largest V threshold. An exemplary value of t2v is 15. 

10 If the test result in step 429 is YES, indicating that the variance is large enough 

that the data was likely to have been encoded on the 4 th least significant bit of the integer 
portion of the average of the V values of the pixels of the block, control is passed to step 
431, in which variable m is set equal to 4. Control then passes to step 427, in which the 
value of the m th least significant bit of the average of the V values of the pixels of the 

15 block is extracted as the value impressed upon this block. The process is then exited in 
step 459. 

If the test result in step 429 is NO, indicating that the variance was not large 
enough that the data was likely to have been encoded on the 4 th least significant bit of the 
integer portion of the average of the V values of the pixels of the block, control passes to 

20 conditional branch point 433, which tests to determine if the average Y variance over the 
block, var(p,q), is greater than a third prescribed threshold t3v, which is the smallest V 
threshold. An exemplary value of t3v is 7. 

If the test result in step 433 is YES, indicating that the variance is large enough 
that the data was likely to have been encoded on the 3 rd least significant bit of the integer 

25 portion of the average of the V values of the pixels of the block, control is passed to step 
435, in which variable m is set equal to 3. Control then passes to step 427, in which the 
value of the m th least significant bit of the average of the V values over the pixels of the 
block is extracted as the value impressed upon this block. The process is then exited in 
step 459. 

30 If the test result in step 433 is NO, indicating that the variance is only large 

enough that the data was likely to have been encoded on the 2 nd least significant bit of the 
integer portion of the average of the V values of the pixels of the block, control is passed 
to step 437, in which variable m is set equal to 2. Control then passes to step 427, in 
which the value of the m th least significant bit of the average of the V values of the pixels 

35 of the block is extracted as the value impressed upon this block. The process is then 
exited in step 459. 
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If the test result in step 421 is that countU is greater than countV, control passes to 
conditional branch point 445, which tests to determine whether the average Y variance 
over the block, var(p,q), is greater than a first prescribed threshold tlu, which is the 
largest U threshold. An exemplary value of tlu is 600. 
5 If the test result in step 445 is YES, indicating that the variance is large enough 

that the data was likely to have been encoded on the 5 th least significant bit of the integer 
portion of the average of the U values of the block, e.g., the value of 
int[sumU(p,q)/(number of pixels per block)], e.g., int[sumU(p,q)/64], control is passed to 
step 445, in which variable m is set equal to 5. Control then passes to step 447, in which 
10 the value of the m ih least significant bit of the average of the U values over the pixels of 
the block is extracted as the value impressed upon this block. The process is then exited 
in step 459. 

If the test result in step 445 is NO, indicating that the variance was not large 
enough that the data was likely to have been encoded on the 5 th least significant bit of the 

15 integer portion of the average of the U values of the pixels of the block, control passes to 
conditional branch point 449, which tests to determine if the average Y variance over the 
block, var(p,q), is greater than a second prescribed threshold t2u, which is the second 
largest U threshold. An exemplary value of t2u is 1 5. 

If the test result in step 449 is YES, indicating that the variance is large enough 

20 that the data was likely to have been encoded on the 4 th least significant bit of the integer 
portion of the average of the U values of the block, control is passed to step 451, in which 
a variable m is set equal to 4. Control then passes to step 447, in which the value of the 
m ih least significant bit of the average of the U values of the pixels of the block is 
extracted as the value impressed upon this block. The process is then exited in step 459. 

25 If the test result in step 449 is NO, indicating that the variance was not large 

enough that the data was likely to have been encoded on the 4 lh least significant bit of the 
integer portion of the average of the U values of the pixels of the block, control passes to 
conditional branch point 453, which tests to determine if the average Y variance over the 
block, var(p,q\ is greater than a third prescribed threshold t3u, which is the smallest U 

30 threshold. An exemplary value of t3u is 7. 

If the test result in step 453 is YES, indicating that the variance is large enough 
that the data was likely to have been encoded on the 3 rd least significant bit of the integer 
portion of the average of the U values of the pixels of the block, control is passed to step 
455, in which variable m is set equal to 3. Control then passes to step 447, in which the 

35 value of the m ih least significant bit of the average of the U values of the pixels of the 
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block is extracted as the value impressed upon this block. The process is then exited in 
step 459. 

If the test result in step 453 is NO, indicating that the variance is only large 
enough that the data was likely to have been encoded on the 2 nd least significant bit of the 
5 integer portion of the average of the U values of the pixels of the block, control is passed 
to step 457, in which variable m is set equal to 2. Control then passes to step 447, in 
which the value of the m th least significant bit of the average value of the U values of the 
pixels of the block is extracted as the value impressed upon this block. The process is 
then exited in step 459. 

10 Note that although the use of 3 thresholds and 4 bit positions has been shown in 

FIGs. 3 and 4, those of ordinary skill in the art will readily be able to adapt the indicated 
method to other numbers of thresholds and encoded values. 

Similarly, not all blocks of each frame or field of the video signal need be 
impressed with additional information. 

15 FIG. 6 shows an exemplary process for determining which particular chrominance 

portion is more suitable, and so should be selected, to contain the watermarking 
information for a pixel. The process is entered in step 601 when it is necessary to select a 
chrominance portion to contain watermarking information. For purposes of discussion of 
FIG. 6, it is assumed that the pixel is represented in YUV format. Furthermore, it is 

20 noted that, preferably, for each original 2x2 luminance block of original video, had the 
original video been in 4-4-4 representation, there should only one Y value for each 
chrominance component, i.e., each pair of respective corresponding U and V values. To 
this end, the Y values of the original block may be downsampled so as to have the same 
resolution as the U and V. Alternatively, the average, or some other combination, of the 

25 Y values associated with a particular U and V values may be computed and used as the Y 
value for the process of FIG. 6. 

Conceptually, each position in a three-dimensional YUV colorspace 
corresponding to a possible pixel position, given the full range that a pixel's Y, U, and V 
values can take, is assigned a chrominance portion, e.g., based on experimental 

30 observations, that is more suitable, and so should be selected, for a pixel having such Y, 
U, and V values. If a version of the entire table for each possible set of Y, U, and V 
values was to be employed, where each of Y, U, and V has a full range of 8 bits, at least 
16M bit of information would need to be stored, assuming that only one bit was stored for 
each position to indicate the selected chrominance portion. Note that use of a single bit 

35 only permits selection of U or V, but not a designation that neither U nor V should be 
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employed. If it were desired to be able to select neither U nor V, 32 Mbits of information 
would be necessary. 

A cutaway view of a portion of exemplary assignments of a chrominance portion 
that is to be selected for each possible pixel within a three-dimensional YUV colorspace 
5 is shown in FIG. 7. Note that FIG. 7 is provided for pedagogical purposes only, as a 
conceptualization visual aid, and does not represent actual data. 

In order to reduce the storage requirements, the YUV colorspace may be 
considered to be a group of regions, each region being defined to include the positions 
corresponding to at least one set, and typically multiple sets, of Y, U, and V values, i.e., 

10 the positions in the colorspace corresponding to at least one pixel, and possibly many 
pixels, and each region, and hence each pixel which maps to that region, is assigned a 
chrominance portion, e.g., based on experimental observations, that is to be selected for 
any pixel whose set of Y, U, and V values fall within the region. One way to look at such 
grouping into regions is a quantization, which may be linear or nonlinear. 

15 Table 1 is a listing for an exemplary colorspace selection table, where each region 

corresponds to 4 Y values, 4 U values, and 4 V values, and hence to 64 possible 
combinations of 8 bit values for any pixel. Using such a table reduces the required 
information to be stored down to 256 Kbits, assuming that only one bit was stored for 
each position, or 512 Kbits, assuming it were desired to be able to select Y, V, and 

20 neither U nor V. Table 1 may be stored in any computer readable medium, e.g., ROM, 
RAM, magnetic storage such as a hard disk or tape drive, optical storage such as a CD- 
ROM or DVD-ROM, or the like. 

Those of ordinary skill in the art will readily recognize that the values employed 
in Table 1, which are for each of Y, U, and V having a full range of 8 bits, may be scaled 

25 for use with 10 bit Y, U, and V values by dividing by 4, e.g., shifting each 10 bit value to 
the right two times. Likewise, other numbers of bits used for Y, U, and V can be 
similarly accommodated. 

In order to effectively arrange and access the data of Table 1, it is arranged so that 
the specified U or V selection, where 1 indicates select U and 0 indicates select V, for 8 

30 adjacent regions having the same U and V quantized values but different sequential 
quantized Y values, are grouped together to form a byte. Thus, for each U and V value 
there are 8 bytes, each corresponding to a region having the same U and V quantized 
values but different quantized Y values. 

Table 1 is arranged to be addressed using an address that has the most significant 

35 bits corresponding to the U values, the next least significant values corresponding to the 
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V values, and the least significant values corresponding to the Y values. In other words, 
the address of the bytes may be formed as follows: 

U7|U6|U5|U4|U3|U2|V7|V6|V5|V4|V3|V2|Y7|Y6|Y5 

where U7,U6,U5,U4,U3, and U2 are the values of the 8 th to 3 rd least significant 
5 bits of the pixels U value, V7,V6,V5,V4,V3, and V2 are the values of the 8 th to 3 rd least 
significant bits of the pixels V value, and Y7,Y6, and Y5, are the values of the 8 th to 6 th 
least significant bits of the pixels Y value. Then, the particular bit within the byte is 
specified by using the 5 th to 2 nd least significant bits of the Y component, e.g., Y4,Y3, and 
Y2. 

10 A table such as Table 1 is reflective of the facts that the human visual system is 

a) less sensitive to the blue color and b) more sensitive to lower luminance values. Such 
a table may be developed by trial and error, generally as follows. 

The colorspace is examined in sections, each section being defined by a 
luminance value and ranging in a first dimension corresponding to a first chrominance 

15 portion changing from its minimum value to its maximum value and in the second 
dimension corresponding to the second chrominance portion changing from its minimum 
value to its maximum value. Any or all of the luminance and the chrominance portions 
may be quantized, e.g., using the 6 most significant bits of 8 bit values. Doing so creates 
a set of planes having a checkerboard of chrominance portion values, which appears when 

20 displayed as blocks of different colors, one plane for each luminance value. For example, 
quantizing so as to use the 6 most significant bits of 8 bit values for the luminance and 
both chrominance portions yields 64 planes that correspond to each possible quantized 
luminance value, and each plane has a checkerboard pattern of colored boxes, with 64 
boxes vertically and 64 boxes horizontally for a total of 4096 boxes per plane. 

25 Each plane is examined separately. Random data is developed for a number of 

frames sufficient to be confident that the random data will have different values in like 
positioned blocks of the frame over time and for an observer to detect flicker should it 
appear. Thirty seconds or longer have proven to be of value. The random data is 
impressed upon frames that contain the plane, but only on a first one of the chrominance 

30 portions, e.g., using the system of FIG. 1 and the process of FIG. 3 to accomplish the 
watermarking but forcing the color selection to be the first chrominance portion. The 
resulting watermarked version of the frames is displayed and observed. 

Any block for which no flicker is observed is indicated in the table that its 
combination of luminance and chrominance portions should employ the chrominance 

35 portion currently carrying the watermark data as the selected chrominance portion for that 
combination. Any block for which flicker is observed is indicated in the table that its 
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combination of luminance and chrominance portions should employ the chrominance 
portion that is not currently carrying the watermark data as the selected chrominance 
portion for that combination. The process is repeated for the plane but changing the 
chrominance portion that is watermarked. 
5 For any block of a plane that flicker occurs for both chrominance portions, as can 

happen, the implementer may choose which chrominance portion should be selected. For 
example, U may be chosen because the human visual system is generally less sensitive to 
blue. Alternatively, the chrominance portion that would provide for better data 
compression of the resulting table may be employed. Similarly, where flicker does not 
10 appear on either block, the choice of the chrominance portion to employ is at the 
discretion of the implementer. 

The process is repeated for each plane until the entire table is populated. 
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Step 603 begins the process of accessing the information when so arranged. More 
25 specifically, in step 603, 

V (P>1) _ y( M ) r 
'(id) J (U) >>D 

U 0d) ~ U 0J) >>Z 

and 

V VJ) ~ V {iJ) >>z 

30 are calculated, 

wherein, similar to that described hereinabove, 

p points to the particular horizontal slice of the frame is being processed and q 
points to the particular column, or vertical slice, of the frame, i points to the particular 
row within the block that is being processed, j points to the particular column within the 
35 block that is being processed, and "»" is a right shift operation. Doing so leaves only 
the desired 8 th to 3 rd least significant bits of the pixels U value, the 8 th to 3 rd least 
significant bits of the pixels V value, and the 8 th to 6 th least significant bits of the pixels Y 
value. Thereafter, in step 605 the lookup table address for the current pixel is calculated 
as 

40 Z0T_A/re5^ where "«" is a left-shift 

operation. 

Doing so combines the extracted bits into a combined address and points to the 
one byte that corresponds to the pixel. Thereafter, in step 607, the particular bit within 
the byte that corresponds to the pixel is determined, by using the value made up of the 2 nd 
45 to 5 th least significant bits of the Y component as an index into the byte. To this end, step 
607 calculates 

6=mod(^f «2,8) 

where mod is the modulo function. 
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In step 609, the value of the 6 th bit position of the byte at the calculated lookup 
table address is extracted, assigned as the value of a variable m, which is supplied as an 
output. Again, in this exemplary embodiment, if the extracted bit is a 1, U is the selected 
chrominance portion while if the extracted bit is a 0, V is the selected chrominance 
portion. 

The process then exits in step 611. 

Those of ordinary skill in the art will readily recognize how to adapt the foregoing 
to pixels in other formats, e.g., RGB or YIQ, 

Note that if Huffman encoding of the table is desired, it may be advantageous that 
the forgoing correspondence of select U being a 1 and select V being a zero should be 
reversed, assuming, as has been seen experimentally, that U is selected for a majority of 
pixel combinations. 

FIG. 8 shows another exemplary process by which the particular chrominance 
portion is selected to contain the watermarking information for a pixel. The process is 
entered in step 801 when it is necessary to select a chrominance portion suitable to 
contain watermarking information. As in FIG. 6, for purposes of discussion of FIG. 8, it 
is assumed that the pixel is represented in YUV format. Furthermore, it is noted that, 
preferably, for each original 2x2 luminance block of original video, had the original video 
been in 4-4-4 representation, there should only one Y value for each chrominance 
component, i.e., each pair of respective corresponding U and V values. To this end, the Y 
values of the original block may be downsampled so as to have the same resolution as the 
U and V. Alternatively, the average, or some other combination, of the Y values 
associated with a particular U and V values may be computed and used as the Y value for 
the process of FIG. 8. 

In order to further reduce the storage requirements in the embodiment of FIG. 8, 
as compared to the embodiment of FIG. 6, not only is the YUV colorspace divided into 
regions, each region including positions corresponding to at least one set of Y, U, and V 
values, with each region being assigned a chrominance portion, e.g., based on 
experimental observations, that is to be selected for any pixel whose Y, U, and V values 
fall within the region, as described in connection with FIG. 6, but any pixel that has a U 
value less than a predefined value, e.g., one-half the maximum value, has the U 
chrominance portion selected for watermarking. Thus, for 8 bit Y 5 U, and V, values, if 
the value of U is less than 128, the U chrominance portion is always selected for 
watermarking regardless of the values of V or Y. This is because human visual system is 
less sensitive to the blue component U than the V component. 
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By having the most significant address bits of the chrominance portion selection 
table correspond to the U-value-derived bits of the address, advantageously, the size of 
the table can be reduced by up to one half. This is achieved by adding a test to determine 
if the U value is less than one half the maximum value prior to forming the table address, 
5 and if the test result is YES, simply indicating to select the U chrominance portion and 
skipping the rest of the process of accessing the table, and also by subtracting one half the 
maximum U value from the actual U value prior to calculating the U-value-derived bits of 
the address. Thus, the section of the table employed for FIG. 6 corresponding to the most 
significant U bit being 0 is eliminated, and only that portion of the table where the most 
10 significant U bit is 1 is retained. However, the indexing into the remaining portion of the 
table is shifted by the subtraction from the U value of the one half of the maximum U 
value prior to forming the U-value-derived bits. 

Thus, the table is arranged to be addressed using an address that has the most 
significant bits corresponding to the U values, the next least significant values 
15 corresponding to the V values and the least significant values corresponding to the Y 
values. In other words, the address of the bytes may be formed as follows: 
U6|U5|U4|U3|U2|V7|V6|V5|V4|V3|V2|Y7|Y6|Y5 

where U6,U5,U4,U3, and U2 are the values of the 7 th to 3 rd least significant bits of 
the pixels U value, V7,V6,V5,V4,V3, and V2 are the values of the 8 th to 3 rd least 
20 significant bits of the pixels V value, and Y7, Y6, and Y5, are the values of the 8 th to 6 th 
least significant bits of the pixels Y value. Then, the particular bit within the byte is 
specified by using the 5 th to 2 nd least significant bits of the Y component, e.g., Y4, Y3, 
and Y2. 

To this end, conditional branch point 802 tests to determine if 
25 U^fJ < predefined _value , where predefined _value is, for example, one half the 

maximum U value. Note that to save a bit, and half the table size, preferably 
predefined_yalue should be a power of 2. If the test result in step 802 is NO, indicating 
that the value of U is less than the predefined value, e.g., one half the maximum value of 
U, e.g., 128, and hence the chrominance portion to be selected will be a function of Y, U, 
30 and V, and so the table must be accessed, control passes to step 803 to begin the process 
of accessing the table. In step 803, 

u\ff = (U[$ - predefined value) » 2 , e.g., u[f$ = (Ufcf - 1 28) » 2 
and 

^ V (U) V {U) >>Z 
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are calculated, 

where, similar to that described hereinabove, 

where p points to the particular horizontal slice of the frame is being processed 
and q points to the particular column, or vertical slice, of the frame, i points to the 
particular row within the block that is being processed, / points to the particular column 
within the block that is being processed, and "»" is a right-shift operation. Doing so 
leaves only the desired 7 th to 3 rd least significant bits of the pixels U value, the 8 th to 3 rd 
least significant bits of the pixels V value, and the 8 th to 6 th least significant bits of the 
pixels Y value. Thereafter, in step 805 the lookup table address for the current pixel is 
calculated as 

LUT _ Address™ = u<$ « 9 + vft? « 3 4- , where "«" is a left- 
shift operation. 

Doing so combines the extracted bits into a combined address and points to the 
one byte that corresponds to the pixel. Thereafter, in step 807, the particular bit within 
the byte that corresponds to the pixel is determined, by using the value made up of the 5 th 
to 2 nd least significant bits of the Y component as an index into the byte. To this end, step 
807 calculates 

6=mod(^gf «2,8) 

where mod is the modulo function. 

In step 809, the value of the 6 th bit position of the byte at the calculated lookup 
table address is extracted and stored in the variable m. The value of variable m is 
supplied as an output in step 811. Again, if the output bit is a 1, U is the selected 
chrominance portion while if the extracted bit is a 0, V is the selected chrominance 
portion. The process then exits in step 813. 

If the test result in step 802 is YES, indicating that the U chrominance portion 
should be selected, because the pixel color is not primarily blue and hence changing the 
blue color of the pixel will not be detected by the human visual system, control passes to 
step 815, in which the variable m is set equal to 1. Doing so assures that U is selected. 
Control then passes to step 811, and the process continues as described above. 

Notwithstanding the foregoing improvements in color selection, with certain Y, 
U, and V values for a pixel, there is still, disadvantageously, a possibility that a slightly 
detectable flickering be manifest. This is because in order to survive MPEG-like 
encoding there may be a need to add large values to the average value of the selected 
chrominance portion. 
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FIG. 9 shows an exemplary transmitter in which the flickering may be reduced, in 
accordance with the principles of the invention, by replicating the data to be impressed, at 
least once, and preferably two or more times, prior to its being impressed upon the 
average value of a chrominance portion of a block. The original and each replica are 
5 transmitted in the same block position of separate consecutive frames. Preferably, the 
frames having like-positioned blocks carrying the same data are consecutive in display 
order. Furthermore, in accordance with an aspect of the invention, specific blocks of the 
frame may be embedded with a particular known data sequence, e.g., a Barker sequence, 
rather than encoded user data. 

10 The embodiment of the invention in FIG. 9 is similar to the arrangement of FIG. 

1. All like-numbered elements of FIG. 9 operate substantially the same as in FIG. 1. In 
addition to those elements of FIG. 1 that are shown in FIG. 9 are repeater 925 and 
optional sequence adder 927. In addition, bit mapper 123 of FIG. 1 is optionally replaced 
in FIG. 9 by bit mapper 923. Replacement of bit mapper 123 by bit mapper 923 is 

15 necessary only if the additional functionality described hereinbelow in connection with bit 
mapper 923 is desired. 

Repeater 925 receives bits either from block interleaver 121 or optional sequence 
adder 927. Repeater 925 stores the received bits and outputs them for like-positioned 
blocks of at least two frames. In one embodiment of the invention, it has been found that 

20 good results are achieved when repeater 925 stores the received bits and outputs them for 
the like-positioned blocks of three frames. Those of ordinary skill in the art will be able 
to trade-off any perceived flicker with desired throughput of the watermark data by 
choosing the number frames for which the data is repeated. 

Optional sequence adder 927 embeds a particular known data sequence, e.g., a 

25 Barker sequence, in specific blocks of the frame, the data sequence being in lieu of 
encoded user data. In accordance with an aspect of the invention, the specific blocks in 
which the data sequence is encoded may be scattered throughout the blocks of a frame. 
Each group of initial and repeated data frames may employ a different known sequence. 
Doing so will enable the receiver to detect the grouping of the frames. Alternatively, the 

30 same sequence may be employed for each group but the specific blocks used for the 
sequence may be different for consecutive groups. 

FIG. 10 shows an exemplary embodiment of a receiver for use in receiving a 
watermarked video signal, such as that produced by the transmitter of FIG. 9, in 
accordance with the principles of the invention. The embodiment of the invention in FIG. 

35 10 is similar to the arrangement of FIG. 2. All like-numbered elements of FIG. 10 
operate substantially the same as they do in FIG. 2. In addition to those elements of FIG. 
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2 there are shown in FIG. 10 sequence processor 1025 and frame weighting unit 1027. 
Furthermore, channel decoder 221 of FIG. 2 is optionally replaced in FIG. 10 by channel 
decoder 1021. 

A receiver, e.g., as shown in FIG. 10, may detect group synchronization using 
sequence processor 1025. This may be performed by adding up the values of the group 
identification sequence from each frame of a group-length-number of consecutive frames, 
which thus are employed as a synchronization pattern, and determining if the result 
exceeds a prescribed threshold. If the threshold is exceeded, it is assumed that the first 
frame whose expected synchronization pattern value was added is the first frame in the 
group. If the threshold is not exceeded, it is assumed that the first frame whose value was 
added is not the first frame of a group. This is analogous to performing an 
autocorrelation on the synchronization pattern. Those of ordinary skill in the art will 
recognize that other conventional techniques for avoiding false matches, as well as 
handling missing the first frame due to errors, such as searching for a maximum prior to 
declaring group synchronization, may be employed. 

Advantageously, once the receiver detects the regular group pattern, any time 
there is a deviation from the pattern the receiver will be able to recognize that a frame of 
the original video sequence has been removed. Such information may be supplied as an 
output by sequence processor 1025. 

For example, in one embodiment of the invention, various commercials of a 
vendor within a video signal may be monitored. The vendor may be assigned a unique 
code that is embedded in each frame of its commercial. A receiver is made aware of the 
particular unique code and which blocks of the watermarked frames should contain the 
code. By detecting the appearance of the code within watermarked frames, the receiver 
can identify a frame as being one that belongs to one of the commercials of the vendor. 
Once a frame with the code is detected, the number of sequential frames incorporating the 
code can be counted to determine the length of the commercial. If the number of frames 
counted is less than the anticipated number of frames based on the known length of the 
commercial when it was originally watermarked, it may be assumed that the commercial 
was inappropriately shortened by removing the number of frames that corresponds to the 
difference between the anticipated number of frames and the counted number of frames. 
Those of ordinary skill in the art will recognize that other conventional techniques for 
avoiding false matches, as well as handling missing the first frame due to errors, may be 
employed. 

Each frame of the commercial, or groups of frames within the commercial, may 
be watermarked with a unique identifier, e.g., a frame or group number, which is part of a 
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distinct sequence over the frame. When a gap in the expected sequence is detected due to 
one or more missing frames, the missing frames may be specifically identified when each 
frame has a unique identifier. When identifiers are assigned only to groups and the 
number of frames in each group is known, only the particular group to which any missing 
frames belongs may be identified, along with the count of how many frames are missing. 

Although replication of the data may be employed to reduce flicker, as indicated 
hereinabove in accordance with the principles of the invention, doing so may limit the 
ability to detect missing frames to merely identifying the group from which the frame is 
missing, rather than being able to identify the particular frame. Therefore, in accordance 
with an aspect of the invention, although the watermark data is generally replicated, at 
least an individual frame identifier may not be replicated. The blocks containing such 
non-replicated frames are placed where they will be least likely to attract attention should 
they cause flickering, e.g., the corners of the frame. Doing so provides the majority of the 
benefit of reducing detectable flicker, while also allowing particular individual frames 
that are missing to be detected. 

If a vendor has different commercials, each of the commercials may have a further 
sequence embedded in at least one of its frames to identify the particular commercial of 
that vendor that is being received. 

Should multiple vendors have watermarked commercials, so long as each vendor 
is assigned a unique code, a system monitoring for the appearance of the commercials of 
a first vendor with a first unique code will ignore commercials of a second vendor with a 
second unique code. Alternatively, a single system may monitor a video signal for the 
appearance of commercials from different vendors that each have a unique code, and the 
results may be segregated by vendor based on their codes. 

In another embodiment of the invention in which multiple vendors have 
watermarked commercials, each vendor employs the same code, and the code may even 
be at the same block locations within the frame for each vendor. However, all the 
subsequent data contained within the frame is encrypted using a unique key for each 
vendor and each vendor has a receiver that knows only the key for that vendor. 
Therefore, each vendor can only decrypt and receive data from its own commercials. In 
another embodiment of the invention, the data for each vendor may be encrypted by 
scrambling the data over the blocks of a frame. Each receiver would know only the 
scrambling pattern for its associated vendor. 

Monitoring for an initial appearance of a code indicating the start of a commercial 
may be performed continuously, or within a window of time during which the 
commercial is expected to be broadcast. 
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In accordance with an aspect of the invention, instead of simply repeating the data 
over the multiple frames of a group and then using bit mapper 123 (FIG. 1), the amount 
added to the average value of a chrominance portion of a block, which depends on the 
complexity of the block and its anticipated quantization level, may be changed slightly 
from frame to frame over a group, even when the complexity of the block is the same at 
corresponding locations from frame to frame. In accordance with an aspect of the 
invention, the change that is made is small with respect to the value being added to the 
average to place the watermark bit within the average value. Such changes may be 
performed by bit mapper 923 (FIG. 9), thereby providing additional coding gain that may 
be advantageously employed to improve the reliability of the data at the receiver. 
However, doing so may cause a slight reduction in visual quality of low texture areas, 
because a few pixels within the block may have different values than their predecessors in 
the same location. However, because such reduction is at the pixel level, it is typically 
not noticeable. 

In one exemplary embodiment of the invention, groups of three time-consecutive 
frames are transmitted with the same watermark data being impressed thereon. The 
middle frame of the group is watermarked as described above in connection with FIG. 3, 
without changing the amount added to the average value of the selected chrominance 
portion of the block from the value determined in FIG. 3. 

The first-in-time frame of the group also has a value computed to be added, i.e., 
an offset bias, by bit mapper 923 (FIG. 9), to the average value of the selected 
chrominance portion of the block that is developed as described in connection with 
FIG. 3. However, in accordance with an aspect of the invention, the bias, e.g., one 
quarter or, preferably one half, of the absolute value of the value being added to the 
average to place the watermark bit within the average value, is additionally added to the 
computed average value of the chrominance portion selected to carry the watermark data. 
Thus, for example, if one is being added to the average value to place the watermark bit 
within the average value, then one half is added to the average value. This translates to 
adding 32 to the sum of the values of the selected chrominance portion of all the pixels of 
the block when there are 64 pixels in a block. Thus, summer 133 will received a higher 
value than it would have had the bias not been added. Similarly, as another example, if -4 
is being added to the average value to place the watermark bit within the average value, if 
one half of the absolute value of the value added to the average value is employed, this 
translates to adding 128 to the sum of the values of the selected chrominance portion of 
all the pixels of the block when there are 64 pixels in a block. 
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Note that this additional bias amount, e.g., 32, will be distributed throughout the 
various pixels based on their luminance variances. Also, this addition of the bias is 
independent of any value added to the average to bring it with a safe range. As a result, 
the average value may fall outside of the safe range. However, the increase in error 
probability engendered by moving out of the safe range is more than offset by the 
resulting coding gain resulting from employing the bias. 

The last-in-time frame of the group has a value computed to be subtracted, i.e., an 
offset bias, by bit mapper 923 (FIG. 9), from the average value of the selected 
chrominance portion of the block that is developed as described in connection with FIG. 
3. However, in accordance with an aspect of the invention, the bias, e.g., one quarter or, 
preferably one half, of the absolute value of the value being added to the average to place 
the watermark bit within the average value, is additionally subtracted from the computed 
average value of the chrominance portion selected to carry the watermark data. Thus, for 
example, if -3 is being added to the average value to place the watermark bit within the 
average value, then one half of the absolute value of -3, i.e., 1.5, is subtracted from the 
average value. This translates to subtracting 96 from the sum of the values of the selected 
chrominance portion of all the pixels of the block when there are 64 pixels in a block. 
Thus, summer 133 will received a lower value than it would have had the bias not been 
subtracted. Similarly, as another example, if 2 is being added to the average value to 
place the watermark bit within the average value, then one half of the absolute value of 2, 
i.e., 1, is subtracted from the average value. This translates to subtracting 64 from the 
sum of the values of the selected chrominance portion of all the pixels of the block when 
there are 64 pixels in a block. 

Note that the loss of the subtracted bias amount, e.g., 32, will be distributed 
throughout the various pixels based on their luminance variances. Further note that this 
subtraction of bias is independent of any value added to the average to bring it with a safe 
range. As a result, the average value may fall outside of the safe range. However, the 
increase in error probability engendered by moving out of the safe range is more than 
offset by the resulting coding gain. 

One way to think about how this works is to look at FIG. 5. As described 
hereinabove, without considering the bias amount oftentimes just enough is added to, or 
subtracted from, the average value of the selected chrominance portion of a block in order 
to reach one of the outer borders of the safe range. Thus, prior to any bias, many frames 
are on or near the border of the safe range. The middle frame to which nothing is added 
or subtracted remains right on the border. The frame to which a slight bias is added may 
move slightly to be better positioned within the safe range, or it may move slightly out of 
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the safe range. The frame from which a slight bias is subtracted moves in the opposite 
direction as the frame to which the bias is added. Thus, in the worst case, for a group of 
three frames one will be within the safe range, one will be on the border of the safe range, 
and one will be slightly out of the safe range. This results in an independent spread of 
values. 

The effect of the bias of the invention may be further magnified because of the 
quantization that is performed by MPEG-like encoding and the separate MPEG-bias that 
is added during the MPEG dequantization. This can result in significant differences in 
the received data values for like-positioned blocks within consecutive frames even when 
the same bit is transmitted over those consecutive frames. 

At the receiver, e.g., as shown in FIG. 10, the data extracted from each frame is 
weighted appropriately using maximum ratio combining based on a quality level that is 
believed to be present for each frame, e.g., in frame weighting unit 1027. To this end, 
sequence processor 1025 may supply to frame weighting unit 1027 a) frame 
synchronization information, so that frame weighting 1027 can know which frames are 
grouped together, and b) the number of errors in the synchronization pattern of each 
frame. The quality level is determined based on how many errors are believed to be in 
the received frame, which can be determined based on how many errors there are in the 
synchronization pattern that is expected for that frame, as extracted by sequence 
processor 1025. Table 1 shows a number of errors for each synchronization pattern and a 
respective weight that has been empirically derived to be appropriate for a frame with 
such a number of errors in its synchronization pattern. In other words, the values of the 
extracted data from each frame may be treated as soft data that is weighted by its 
associated weight as part of the combining process. 

Based on the weights, the multiple instances of the same data bit for 
corresponding block locations in successive frames are extracted and combined to form a 
single received bit. This may be achieved by computing 

bit _ out = (2 - 1) — 1 — ] - 1 — ^ 2 — 3_ ? w h ere 

O, +W 2 +W 3 ) 

bit out is the final output bit for the group of three frames; 

w\ 9 w 2 , and w 3 are the weights for each of the first, second and third in time 

frames; 

bit u bit 2 , and bih are the bits from the like-positioned block of the first, second 
and third in time frames; and 

n is the number of bits that the soft decoder input precision. 



D:\PATENTS\Zarrabizadeh 23\Zarrabizadeh 23.doc 



52 



Zarrabizadeh 23 



To best make use of the soft information, channel decoder 1021 is a so-called soft 
decoder that employs soft data bits, i.e., data bits that are each represented as a non binary 
number the range of which depends on the soft decoder input precision. For example, an 
8 bit input precision soft decoder operates with values between 0 and 255. To this end 

5 the weighted average of the received hard bits, — — ! — 2 - 3 - — 3 - , is multiplied by 

2"-l, thereby converting the weighted average into a soft value of the appropriate 
precision that can be processed by the soft decoder. 

In accordance with an aspect of the invention, when the determined quality of a 
particular frame is below a prescribed threshold, it may be assumed that the particular 

10 frame does not contain any watermarking data and no data is extracted for that frame. 

Those of ordinary skill in the art will readily recognize that which frame has the 
value added, which has it subtracted and which has no change; whether addition and 
subtraction are both necessary; the number of frames in a group; and any rounding to be 
performed on the value to be added or subtracted or the resulting value are at the 

1 5 discretion of the implementer. 
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